Identity security architecture systems and methods

ABSTRACT

Embodiments of various systems and methods described herein provide an identity security database analytics system which is configured to provide security alerts to a user. The security alerts can include for personalized metrics related to potential identity theft incidents. The personalized metrics can include user specific information on security breaches of the user&#39;s personal information as well as depersonalized statistics generated based on information of other users having one or more similar characteristics of the user.

PRIORITY CLAIM

This application claims the benefit of priority under 35 U.S.C. § 119(e) of U.S. Provisional Application No. 62/565,014, filed on Sep. 28, 2017, the entire contents of which is hereby incorporated by reference in its entirety herein and should be considered part of this specification.

LIMITED COPYRIGHT AUTHORIZATION

A portion of the disclosure of this patent document includes material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyrights whatsoever.

FIELD OF THE DISCLOSURE

The present disclosure generally relates to information security such as detecting and alerting an identity theft in a computer environment.

BACKGROUND OF THE DISCLOSURE

Identity theft is one of the largest and expanding forms of crime in the world. Identity theft is a crime in which an imposter obtains key pieces of information, such as social security numbers, driver's license numbers, email addresses, and so forth, and uses it for his or her improper personal gain. The imposters can obtain new accounts; re-direct the mail and telephone calls from existing accounts; sign up for unwanted and often expensive services; order subscriptions; order and take delivery of products; and otherwise “become” the individual whose identity has been stolen, minus the conscience and fiscal responsibility.

SUMMARY OF EXAMPLE EMBODIMENTS

Various systems and methods described herein provide a database analytics system which is configured to provide alerts to a user for personalized metrics related to potential identity theft incidents. The alerts can include user specific information on security breaches of the user's personal information as well as depersonalized statistics generated based on information of other users having one or more similar characteristics of the user.

In one embodiment, an identity security system comprises a non-transitory data storage configured to store computer executable instructions for an identity data analysis system; a dark web data store configured to store compromised data related to users' personal identifying information (PIT) and aggregated statistics calculated based on the compromised data; a plurality of identity alerts data stores configured to store alerts sent to the users and metadata associated with the alerts; an identity metrics data store configured to store depersonalized summary statistics. The hardware processor can be programmed to execute the computer executable instructions in the non-transitory data storage to cause the identity security system to: receive an instruction to generate an alert comprising personalized metrics of a user, wherein the instruction comprises PII of the user; query a dark web data store with the PII of the user to acquire information on security compromises related to the PII of the user; query an identity alerts data store with geographic location information of the user to obtain report statistics associated with a geographic region of the user; query an identity metrics database to obtain the depersonalized summary statistics, wherein the depersonalized summary statistics are generated based on a group of individuals whose PIIs overlaps at least with a portion of the user's PII; generate an alert which comprises information on security compromises related to the PII of the user, report statistics associated with a geographic region of the user, and depersonalized summary statistics; and deliver the alert to a client computing device via a network.

In another embodiment, a method for protecting identity security can comprise: receiving an instruction to generate an alert comprising personalized metrics of a user, wherein the instruction comprises PII of the user; querying a dark web data store with the PII of the user to acquire information on security compromises related to the PII of the user; querying an identity alerts data store with geographic location information of the user to obtain report statistics associated with a geographic region of the user; querying an identity metrics database to obtain depersonalized summary statistics, wherein the depersonalized summary statistics are generated based on a group of individuals whose PIIs overlaps at least with a portion of the user's PII; generating an alert which comprises information on security compromises related to the PII of the user, report statistics associated with a geographic region of the user, and the depersonalized summary statistics; and delivering the alert to a client computing device via a network.

In yet another embodiment, non-transitory computer readable medium storing computer executable instructions thereon, the computer executable instructions when executed can cause an identity security system to: receive an instruction to generate an alert comprising personalized metrics of a user, wherein the instruction comprises PII of the user; query a dark web data store with the PII of the user to acquire information on security compromises related to the PII of the user; query an identity alerts data store with geographic location information of the user to obtain report statistics associated with a geographic region of the user; query an identity metrics database to obtain depersonalized summary statistics, wherein the depersonalized summary statistics are generated based on a group of individuals whose PIIs overlaps at least with a portion of the user's PII; generate an alert which comprises information on security compromises related to the PII of the user, report statistics associated with a geographic region of the user, and the depersonalized summary statistics; and deliver the alert to a client computing device via a network.

Although certain embodiments and examples are disclosed herein, the subject matter extends beyond the examples in the specifically disclosed embodiments to other alternative embodiments and/or uses, and to modifications and equivalents thereof. The systems, methods, and devices of the disclosure each have several innovative aspects, no single one of which is solely responsible for the desirable attributes disclosed herein.

BRIEF DESCRIPTION OF THE DRAWINGS

Throughout the drawings, reference numbers are re-used to indicate correspondence between referenced elements. The drawings are provided to illustrate embodiments of the subject matter described herein and not to limit the scope thereof.

FIG. 1 illustrates an embodiment of a computing environment for an identity management system.

FIG. 2A illustrates an example embodiment of an identity management system for collecting data used to generate personalized metrics.

FIG. 2B illustrates example embodiments of data collection processes.

FIG. 3A illustrates an example embodiment of a metrics calculation system for generating personalized metrics of a user.

FIG. 3B illustrates an example embodiment of data synchronization between an identity metrics data store and a dark web data store.

FIG. 3C illustrates an example embodiment of calculating identity statistics based on data from a dark web data store.

FIG. 3D illustrates an example embodiment of synchronizing the identity metrics database with a plurality of report and alert metadata databases.

FIG. 3E illustrates example embodiments of statistical computation processes.

FIG. 4A illustrates an example embodiment of a data flow diagram for report generation.

FIG. 4B illustrates example embodiments of reports delivered to a client computing device.

FIG. 5A illustrates an example embodiment of an identity metrics analysis process.

FIG. 5B illustrates an example embodiment of an identity monitoring process.

FIG. 6 illustrates an embodiment of a computing system which may implement example embodiments of an identity management system.

DETAILED DESCRIPTION OF VARIOUS EMBODIMENTS Overview

With the spread of computer and Internet usage, there is a growing concern about the increased risk of identity theft. Individuals often reveal their personally identifiable information (PIT) on the Internet when conducting transaction or interacting with various websites. The PII can include information that can be used to uniquely identify, contact, or locate an individual person or can be used with other sources to uniquely identify, contact, or locate an individual person. PII may include, but is not limited to, social security numbers (SSN), bank or credit card account numbers, passwords, birth dates, and addresses. Imposters can acquire the PII from various sources, such as the Internet, and take advantage of sensitive personal information, such as, for example, by committing fraud with the stolen identity. As an example, it is commonplace for PII to be compromised and utilized for identity theft in a remote commercial transaction which involves little or no direct personal contact between a consumer and a goods or services provider.

Identity theft resulting from compromised PII (such as the PII obtained without proper authorization) is costly to victims and companies alike. The Identity Fraud Survey Report created by Javelin Strategy & Research reported that in 2009 victims averaged a personal cost of $373 and 21 hours of time to resolve identity-theft issues. The annual cost of identity theft currently exceeds $200 billion worldwide. In addition, as a result of new legislation and litigation resulting from compromised PII, companies stand to suffer from lower profit margins, damaged credibility due to negative customer experiences, and eroded brand value. Identity theft also looms as a threat to the advancement of promising consumer-driven, self-service, and cost-savings technologies.

Identity theft can be discovered using monitoring platforms which conduct automated inquiries and searches of a variety of third party sources to locate matches to a person's information and providing an automated alert if an anomaly is detected. For example, an identity management system can alert a user where suspicious use of its PII is discovered, and can allow them to remediate the damage of the stolen information if an identity theft occurs. However, the rate on how often a person receives the identity alert is typically low. For example, a person can subscribe to an identity monitoring service but only get one to two alerts during the entire subscription. As a result of the low alert frequency, the person may not know if there are no alerts or if their account has been compromised.

Thus, it is important that an identity monitoring service increases the frequency of the alerts but yet continues to provide relevant and valuable information in the alert without disclosing the PII of other persons. Advantageously, in some embodiments, the identity management system described herein can generate relevant alerts for a user more frequently. For example, the identity management system can utilize a web-based Application Programming Interface (API) to receive a user's personal and demographic information from a client computing device. The API can generate personalized metrics for the user and return an alert including the personalized metrics. The personalized metrics can include identity metrics specific to the user's PII, such as the number of hits found for user's PII in a security breach (for example, the number of hits of the user's email address in a search of dark websites). Advantageously, in some embodiments, the personalized metrics can also include depersonalized identity metrics such that the statistics of the identity metrics are based on a group of users whose identities are not revealed to the user. Some example of depersonalized identity metrics can include the number of alerts that have been generated for other monitored consumers with in the same geographical region of the user, the total number of compromised email addresses where the number of compromised email addresses matches the user's email address domain, the total number of compromised credit cards where the credit cards are from the same issuer as the user's credit card, the total number of comprised phone numbers having the same area code as the user's phone number, and so forth. The personalized metrics can be calculated over different time horizons. For example, the personalized metrics can be calculated based on historical total, last 5 days, last 60 days, last 3 months, or other time durations.

The personalized metrics can be calculated based on statistics generated using dark web data (which may include compromised personal information) and historical identity monitoring alerts associated with other individuals (and/or the user). The statistics can further be refined or filtered to generate personalized metrics based on the person's personal and demographic information. For example, the depersonalized statistics may be filtered to identify the number of notifications sent to users within the same geographic areas as the user whose identity metrics are calculated. Accordingly, a user can learn the likelihood that his personal information is stolen based on compromised data from similar individuals.

As will be appreciated by one skilled in the art, there are numerous ways of carrying out the examples, improvements, and arrangements of the identity management system in accordance with the embodiments disclosed herein. Although reference will be made to the illustrative embodiments depicted in the drawings and the following description, these embodiments are not meant to be exhaustive of the various alternative designs and embodiments that are encompassed by the disclosed embodiments. Those skilled in the art will readily appreciate that various modifications may be made, and various combinations can be made, without departing from the disclosure.

For purposes of this disclosure, certain aspects, advantages, and novel features of various embodiments are described herein. It is to be understood that not necessarily all such advantages may be achieved in accordance with any particular embodiment. Thus, for example, those skilled in the art will recognize that one embodiment may be carried out in a manner that achieves one advantage or group of advantages as taught herein without necessarily achieving other advantages as may be taught or suggested herein.

Computing Environment of Identity Management System

FIG. 1 illustrates an embodiment of a computing environment for an identity management system. The identity management system 130 can be in communication with a client computing device 110 via a network 108. For example, client computing device 110 can generate and send request messages to obtain a user's identity metrics or a request to monitor a user's identity metric to the identity management system 130. The identity management system 130 can calculate identity metrics for the user based on data associated with individuals similar to the user (such as, for example, data obtained from dark web sites) as well as data particular to the user's PII. The identity management system 130 can generate and return an encrypted report package which includes the identity metrics to the client computing device. To simplify discussion and not to limit the present disclosure, FIG. 1 illustrates only one client computing device 110 and one identity management system 130, though multiple systems may be used.

Client Computing Device

The illustrated client computing device 110 can implement a web portal or application to communicate with the identity management system 130. For example, the client computing system 110 can host a software application which includes an API for requesting and receiving a user's identity metrics. As another example, the client computing device 110 can communicate with the identity management system 130 via a browser application. The client computing device 110 can request and receive reports including identity metrics via HTTP protocols.

The client computing device 110 can be associated with an identity monitoring entity 112 or a user 114. The identity monitoring entity 112 may be a company or a software suite which offers identity monitoring or reporting to an individual. The identity monitoring entity 112 may also be an entity interested in obtaining another individual's information (such as for example, where a company runs a background check of an employee candidate). A user may be an individual whose identity metrics are calculated and reported.

In some embodiments, a user (for example, the user A 114) can send an electronic request for a report of his identity metrics. A user can communicate directly with the identity management system 130 to request a report. For example, as illustrated in FIG. 1, the user A 114 can request his identity metrics from the identity management system 130 via the network 108 or enroll in an identity monitoring service without going through the identity monitoring entity 112.

The identity monitoring entity 112 can act on behalf of a user to obtain the user's information. For example, the identity monitoring entity 112 can provide an online portal which allows the user B 116 to input his PII. The identity monitoring entity 112 can use the user B's PII to request personalized metrics for user B. Based on the PII provided by the identity monitoring entity 112, the identity management system 130 can return a report comprising the personalized metrics for user B 116. As further described with reference to FIGS. 4A and 4B, the personalized metrics may be generated based on data specific to the user's PII or data from individuals similar to the user B 116.

As another example, the identity monitoring entity 112 can subscribe on behalf of the user B 116 for monitoring the user B's 116 identity. Once the identity management system 130 receives the user B's PII, the identity management system 130 can register the user B's information and periodically sending personalized metrics to the identity monitoring entity 112 or directly to the user B. The personalized metrics may be delivered as an electronic alert (which is also referred to as a report), for example, via an email, a mobile application, a text message, and so on.

The client computing device 110 can implement one or more components of the computing system 600 as described with reference to FIG. 6. The modules 609 of FIG. 6 can include software executable code for receiving PII from a user (for example, user B 116 or user A 114), and for communicating to the identity management system 130 a request for a report or a request to enroll a user into periodic monitoring of the user's identity metrics. The software executable code can also include routines for delivering or displaying a report to a user (for example, user B 116).

Network

The network 108 can comprise one or more of local area network (LAN), wide area network (WAN), and/or the Internet, for example, via a wired, wireless, or combination of wired and wireless communication links. The network 108 may communicate with various computing devices and/or other electronic devices via wired or wireless communication links. The network 108 may be used to share resources and may be used for the analog and/or digital transfer of data and information. In some embodiments, the network 108 can be used to enable multiple devices access to a single system, enable file sharing across the network 108, share software programs on remote systems, or make information easier to access and maintain among network users.

A remote system or device may include data, objects, devices, components, and/or modules not stored locally, that are not accessible via the local bus. Thus, remote devices may include a device which is physically stored in the same room and connected to the user's device via a network. In other situations, a remote device may be located in a separate geographic area, such as, for example, in a different location, country, and so forth.

Although not specifically illustrated in FIG. 1, the user B 116 can also communicate with the identity monitoring entity 112 via the network 108 and the identity management system 130 can also acquire comprised data from the dark web sites 120 via the network 108.

Identity Management System

The identity management system 130 illustrated in FIG. 1 includes a client gateway 162, an identity monitoring system 164, a user identity registration system 166, an identity alerts data store 144, a dark web data store 142, a data acquisition system 130, and an identity data analysis system 150. In various embodiments, one or more of these systems may be combined into a single system or one or more systems may be part of another system. For example, the identity monitoring system 164 and the client gateway 162 may be combined as one system. As another example, the user identity registration system 166 may be part of the identity monitoring system 164. The identity management system 130 can also include other components not illustrated in FIG. 1. As an example, the identity management system 130 may include a report and alert data acquisition system 230 configured to calculate report statistics associated with reports sent to the users of the identity management system 130.

Client Gateway, Identity Monitoring System, and User Identity Registration System

The client gateway 162 can be configured to receive a request for personalized metrics from the client computing device 110. The request may include PII of a user. The client gateway 162 can parse the request to extract PII and feed the PII into the identity data analysis system 150 to obtain a report comprising the personalized metrics for the user. The client gateway 162 can pass the report back to the client computing device 110 for delivery or display to a user. In some embodiments, the identity data analysis system 150 can pass the report directly back to the client computing device 110 via the network 108.

In some embodiments, the request from the client computing device 110 may be a subscription request which includes a request to subscribe a user to receiving periodic reports of the user's personalized metrics. The client gateway 162 can determine that the request includes a subscription request, such as by parsing the electronic request data package, and forward the request to the identity monitoring system 164 for further processing (such as storing the user's PII in a data store or registering the user to the period monitoring).

In some embodiments, if the client computing device 110 can send the subscription request directly to the identity monitoring system 164 without first routing through the client gateway 162. The identity monitoring system 164 can be configured to receive a subscription request for subscribing a user to a monitoring service where the identity management system 130 periodically calculates the personalized metrics for the user and delivers the personalized metrics to the user. For example, the identity monitoring system 164 can receive a request to subscribe a user from the client computing device 110. The request can include the user's PII. The identity monitoring system 164 can extract the PII from the request and pass the PII to the user identity registration system 166.

The user identity registration system 166 can initiate storage of the user's PII into the identity alerts data store or another data store configured to store subscribers' information. The user identity registration system 166 can further associate the user's PII with periodic monitoring and calculation of personalized metrics. The identity monitoring system 164 can periodically cause the identity data analysis system 150 to calculate personalized metrics and pass the periodic calculation to the client computing device as a report. For example, the identity monitoring system 164 can periodically send a request for a user's personalized metrics to the identity data analysis system 150. As another example, the identity monitoring system 164 can periodically send the request to the client gateway 162 which can pass the request to the identity data analysis system 150 for calculating the user's personalized metrics.

Although the identity monitoring system 164 and the user identity registration system 166 are illustrated as two separate blocks, the user identity registration system 166 may be part of the identity monitoring system 154. Thus, the identity monitoring system 164 can be configured to provide periodic monitoring service to the identity monitoring entity 112 and a user A 114, where the identity monitoring system 164 can subscribe a user to a periodic monitoring of his personalized metrics, periodically calculate the user's personalized metrics, and deliver the personalized metrics to the client computing device 110.

In some embodiments, the client gateway 162 and the identity monitoring system 164 may be combined to become one system which can be configured to handle requests from the client computing device 110 and provide reports to the client computing device 110. For example, the combined system may receive a request for personalized metrics or a request to subscribe a user to periodic alerts related to the personalize metrics. The combined system can pass the PII associated with the request to the identity data analysis system 150 to retrieve the personalized metrics. The combined system can also register a user for periodic monitoring related to his identity information and periodically communicate with the identity data analysis system 150 to retrieve the personalized metrics.

Dark Web Data Acquisition System

The illustrated dark web data acquisition system 140 can be configured to identify, acquire, and analyze data associated with PII that may be compromised. PII can be compromised in a myriad of ways. Record keeping for entities such as, for example, healthcare, governmental, financial, and educational institutions, is increasingly and sometimes exclusively electronic. Electronic record keeping introduces new risks for which the entities are frequently ill-equipped to handle. For example, PII is often compromised via stolen hardware, inadequate security procedures, security breaches, or employee carelessness or misconduct.

Another way that PII is frequently compromised is via phishing. Phishing is the process of attempting to acquire PII by masquerading as a trustworthy entity in an electronic communication. A common example of phishing is a fraudulent email that is made to appear as though it originates from a valid source such as, for example, a national bank. The fraudulent email may incorporate a uniform resource locator (URL) that re-directs its target to a false website that appears to be a legitimate website for the valid source. In actuality, the false website may be a front for stealing PII as part of a spurious transaction. For example, the false website may request a confirmation of PII such as, for example, a credit card number or a username and password. The PII may then be stored for later improper use such as, for example, identity theft in a remote commercial transaction.

The dark web data acquisition system 140 can acquire data associated with PII and determine whether the PII is compromised by scanning dark web sites. The data associated with compromised PII can also be referred to compromised data. The dark web data acquisition system 140 can utilize search engines, web spiders, keyword-matching features, data filters, and so forth to uncover information associated with PII. For example, the search engines and the web spiders may be utilized to collect identity-theft information, such as, for example, potential sources of compromised PII. The potential sources of compromised PII may include, for example, websites and forums that facilitate exchange of compromised PII (for example, by identity thieves). Further, keyword-matching features may be leveraged to analyze the potential sources of identity-theft information using, for example, identity-theft nomenclature.

Oftentimes, compromised PII is exchanged via chat rooms (for example, between identity thieves on Internet Relay Chat (IRC) channels). The dark web data acquisition system 140 can utilize an IRC bot to crawl the Internet in search of chat rooms (for example, IRC channels) that are frequented by identity thieves. The IRC bot can be operable to monitor such chat rooms for identity-theft nomenclature. Furthermore, the IRC bot 101 can be operable to identify and collect compromised PII, uniform resource locators (URLs), references to other IRC chat rooms, and other identity-theft information from such chat rooms.

In addition to or in alternative to scanning the dark websites, the dark web data acquisition system 140 can also acquire compromised PII from other sources. For example, individuals' PIIs may be compromised due to a security breach of an electronic commerce websites. The dark web data acquisition system 140 can acquire the compromised PIIs from the electronic commerce websites. As another example, a user may report an account as stolen to an account issuer. The account issuer can report the stolen account to the dark web data acquisition system 140 and the stolen account (as well as information associated with the stolen account) can thus be flagged as compromised PII.

The compromised PII may be parsed, analyzed, and stored into the dark web data store 142. For example, the identity-theft information collected from the dark websites 120 or other sources may be processed for compromised PII by one or more parsers that recognize common formats for PII (such as for example recognizing a list of text strings as email address or credit card numbers). Advantageously, in some embodiments, the dark web data acquisition system 140 can de-personalize the compromised data. For example, the data web data acquisition system 140 can calculate statistics based on the compromised data and store the statistics in the dark web data store 142 or another persistent storage location. In some embodiments, the statistics may be stored in dedicated lookup tables (as illustrated by aggregated statistics database 224). As an example, one of the statistics can be associated with email domains. The dark web data acquisitions system 130 can calculate the number of hits during a time period for a certain domain name (such as, for example, “email.exampledomain.com”) as found in the data from the dark web sites 120. Advantageously, in some embodiments, such statistics are depersonalized such that they are not specific to a person's PII but nevertheless provide valuable information for the user based on the statistics of other individuals whose email addresses have the same domain name. As further described with reference to data aggregation and synchronizations system 156, in some embodiments, the process of calculating aggregated statistic and/or de-personalizing the compromised data may be performed by the data aggregation and synchronization system 156.

Identity Data Analysis System

The identity data analysis system 150 can include a data aggregation and synchronization system 156, a metrics calculation system 154, report generation system 152, and an identity metrics data store 158. The data aggregation and synchronization system 156 can be configured to acquire compromised PII or statistics (shown in FIG. 3A) stored in the dark web data store 142. The data aggregation and synchronization system 156 can also be configured to acquire report and alert data associated with users of the identity management system 130 by communicating with the identity alerts data store 144.

The data aggregation and synchronization system 156 can synchronize the data acquired from the dark web data store 142 and the identity alerts data store(s) 144 with the data in the identity metrics data store 158. The identity metrics data store 158 can be configured to store master statistics tables (such as, for example, the tables 342 and 344 shown in FIG. 3A) which can include statistics associated with various identity metrics. The identity metrics data store 158 can also be configured to store counterpart tables which correspond to tables in the dark web data store 142 and the identity alerts data store(s) 144. The data aggregation and synchronization system 156 can synchronize the counterpart tables may be synchronized with the corresponding tables in the dark web data store 142 and the identity alerts data store(s) 144. Data in the counterpart tables may be used to calculate statistics associated with various identity metrics and update the master statistics tables stored in the identity metrics data store 158.

Although not shown in FIG. 1, the data aggregation and synchronization system 156 can also be configured to aggregate and synchronize data from other data sources such as, a data store associated with one or more credit bureaus which can store a user's credit data.

The master statistics tables stored in the identity metrics data store 158 can be used by the metrics calculation system 154 to generate personalized metrics for a user. For example, the metrics calculation system can use PII of the user to determine a user's zip code. The metrics calculation system 154 can access the identity metrics data store 158 to determine the number of compromises of PIIs within the user's zip code. The metrics calculation system 154 can return the number of PIIs to the client computing device 110 (for example, via the report generation system 152), which may indicate a likelihood on whether the user is living in a region with a high risk of identity theft. In some embodiments, the metrics calculation system 150 can update the statistics in the identity metrics data store periodically based on the data synchronization with the dark web data store 142 and the identity alerts data store(s) 144. For example, the metrics calculation system 150 can re-calculate the number of compromised incidents for a given email domain after the metrics calculation system 150 receives an update from the data aggregation and synchronization system 158 for the latest data received from the dark web data store 142.

As further described with reference to FIG. 4A, the report generation system 152 can be configured to acquire a report based on data from the dark web data store 142, the identity alerts data store(s) 144, and the identity metrics data store 158. For example, the report may include information on an email domain name and the site where the breach occurs, as well as the number of breaches identified for the same email domain name. The report generation system 152 can obtain the email domain name and the site of breach from the dark web data store 142 and obtain the number of breaches identified for the same email domain name from the identity metrics data store. The report can also include the number of notifications sent to people within the same zip code as the user. The report generations system 152 can obtain information on the number of notifications from the identity alerts data store(s) 144.

As further described with reference to FIGS. 4A and 4B, the report may be communicated to the client computing device 110 in various ways. For example, the report may be in an extensible markup language (XML) format and can be communicated to the client computing device 110 via an API. The report can also be a hypertext markup language (HTML) webpage which can be rendered by a browser of the client computing device 110. In some embodiments, the client computing device 110, the client gateway 162, or the identity monitoring system 164, can automatically populate an HTML template for rendering of the XML report.

Although the report generation system is illustrated as a separate system from the metrics calculation system 154, the report generation system can also be part of the metrics calculation system 154. Further, as described with reference to FIG. 4A, the report generation system 152 can also be part of the client gateway 162.

The identity data analysis system 150 may also be used to perform analytics on the dark web data store 142 or data received from the dark web. For example, the identity data analysis system 150 may determine that there has been a spike in the number of compromised credentials related to a particular domain (for example, Company A), which could be used by the system to send an alert for a potential breach with Company A's data.

Dark Web Data Store and Identity Alerts Data Store(s)

The dark web data store 142 can be configured to store data acquired from the dark web sites 120. The data from dark websites 120 may include information related to security breaches of one or more user's PII. For example, the dark web data store 142 can store chat logs from one or more chat rooms on the dark web. The dark web data store 142 can also be configured to store the results based on the analysis of data acquired from the dark websites 120. For example, with reference to FIG. 2A, the dark web data store 142 can store parsed content 222 (such as, for example, email address or breach site identified from the data acquired from dark web sites). As another example, the dark web data store 142 can include aggregated statistics database 224 configured to store various statistics calculated from dark web data, such as for example, the number of compromises associated with an email domain name, the number of compromises associated with a phone area code, the number of compromises associated with an issuer identification number (also referred to as BIN), or the number of hits associated with a postal code, so on, alone or in combination. Each type of the statistics may be stored in a dedicated look up table. For example, as shown in FIG. 3A, statistics related to emails are stored in the table 322, statistics related to address are stored in the table 324, statistics related to financial cards are stored in the table 326, and statistics related to phone numbers are stored in the table 328.

In addition to or in alternative to storing dark web data, the dark web data store 142 can also be configured to store data from other sources. For example, the dark web data store 142 can store the data relating incidents of security breaches of a user's PII acquired from an electronic commerce website.

The identity alerts data store(s) 144 can be configured to store users' information, such as for example, the user's PII, although in some embodiments, the user's PII may be stored in the dark web data store 142 in addition to or in alternative to the identity alerts data store(s) 144. The identity alerts data store(s) 144 can also be configured to store information associated with reports of the users, such as the content of the reports, the reports themselves, or the metadata of the reports. The metadata of the report can include report statistics. As shown in the report and alert metadata table 332 (in FIG. 3A), the report statistics can include a postal code, record counts for the postal code, the user identifier which the report was sent to, date, and so forth. As shown in FIG. 2A, the content of the reports and/or the reports themselves can be part of the report and alert items 242, while the metadata of the reports can be stored as part of the repot and alert metadata database 224.

Although the disclosures uses the words “data table” or “table”, data stored in various data stores or databases are not limited to a tabular data structure and can include non-relational data stores as well. Other types of data structures, such as, for example, trees, hashes, graphs, blockchain structures, and so on, can also be used to store various data described herein.

Examples of Data Collection

FIG. 2A illustrates an example embodiment of an identity management system 130 for collecting data used to generate personalized metrics. FIG. 2A illustrates three main components: dark web data acquisition system 140, report and alert data acquisition system 230, and identity data analysis system 150 that are involved in the data collection process. These three components can be part of the identity management system 130, even though one or more of these components may be implemented by different computing devices. For example, the report and alert data acquisition system 230 may be implemented on a cloud platform while the dark web data acquisition system 140 and/or the identity data analysis system 150 may can be implemented on a separate platform with dedicated servers.

As described with reference to FIG. 1, the dark web data acquisition system 140 can be configured to scan and retrieve compromised data such as identity-theft information from dark web sites 120, parse the compromised data to generate the parsed content 222, compile aggregated statistics 222 based the compromised data, and insert the parsed content 222 and the aggregated statistics 224 into the dark web data store 142. The dark web data acquisition system 140 can implement the process 280 shown in FIG. 2B to obtain and analyze compromised data.

The report and alert data acquisition system 230 can be configured to access reports and store the reports into the identity alerts data store(s) 144. The reports can be received from client gateway 162 or the identity monitoring system 164. The reports can also be received from the report generation system 152. In some embodiments, the report generation system 152 may be part of the report and alert data acquisition system 230.

The report and alert data acquisitions system 230 can maintain the report and alert metadata database 244 which can be configured to store metadata 332 associated with reports (shown in FIG. 3A). The metadata can include report statistics such as the number of reports, the types of reports sent to the users in a geographic region. The report and alert data acquisition system 230 can implement the process 290 shown in FIG. 2B for gathering reports, storing the reports and metadata associated with the reports.

Although only one report and alert data acquisition system 230 is illustrated in FIG. 2A, the identity management system 130 may utilize multiple instances of the report and alert data acquisition system 230. Each report and alert data acquisition system 230 may be associated with a specific group of individuals who share certain demographic information or may be assigned to a certain geographic region. For example, each report and alert data acquisition system 230 may be associated with individuals having the same zip code. Each report and alert data acquisition system 230 can maintain its own a report and alert metadata database 244 specific to its group of individuals. For example, the report and alert metadata for a report and alert acquisition system 230 may include report statistics specific to the report and alert acquisition system 230. As an example, the report and alert metadata may include the number of reports delivered for a zip code assigned to a certain report and alert acquisition system 230.

In some situations, the report and alert metadata database 244 or the report and alert items 242 for respective report and alert data acquisition system 230 can be maintained without truncations, regardless of whether an individual is actively enrolled in monitoring its identity metrics. In some embodiments, each report and alert data acquisition system 230 can maintain its own identity alerts data store 144 which can include report and alert items 242 and report and alert metadata database 244 specific to the population group assigned to the report and alert data acquisition system 230. In other implementations, an identity alerts data store 144 can include a plurality of report and alert metadata databases 244 and/or report and alert items 242 (which are associated with their respective report and alert data acquisitions systems 230).

The identity data analysis system 150 can be configured to generate personalized metrics for a user based on data from the dark web data store 142 and the identity alerts data store(s) 144. As further described with reference to FIG. 3A, the data aggregation and synchronization system 156 can access data from the dark web data store 142, such as aggregated statistics database 224, and data from the identity alerts data store(s) 144, such as report and alert metadata 332. Such data access can occur on a periodic basis, such as for example, every day, every 12 hours, every week, every month, every two months, every year, and so on, and use either a push or pull technique. The data acquired by the data aggregation and synchronization system 156 can be stored in the identity metrics data store 158 and/or communicated to a metrics calculation system 154 which can statistics related to identity metrics. The statistics related to identity metrics can be calculated based on data in the identity metrics data store 158, the data recently acquired by the data aggregation and synchronization system 156, alone or in combination. The results from the calculation may be stored in the identity metrics data store 158 as part of one or more master statistics tables. The metrics calculation system 154 can retrieve the relevant statistics from the master statistics tables based on the PII of the user and communicate the relevant statistics to the report generation system 152 for inclusion in a report.

In addition to or in alternative to the dark web data store 142 and the identity alerts data store(s) 144, the identity data analysis system 150 can also generate reports based on data received from other sources. For example, the identity data analysis system 150 may receive credit reports from one or more credit bureaus' databases, calculate metrics based on the content of the credit reports, and include metrics calculated from such credit reports (or the content of the credit reports) to be part of the personalized metrics of a user.

Data Collection Processes

FIG. 2B illustrates example embodiments of data collection processes, which include a dark web data acquisition process 280 and a report and alert data acquisition process 290. The process 280 can be implemented by the dark web data acquisition system 140 and the process 290 can be implemented by the report and alert data acquisition system 230, but it is also recognized that the processes can be implemented by other systems.

Dark Web Data Acquisition Process

At block 282, the dark web data acquisition system can scan for and retrieve compromised data from dark webs sites. For example, the dark web data acquisition system can use search engines, web spiders, IRC bots, and key-word matching features to identify and collect identity-theft information, such as, for example, potential sources of compromised PII, the personal information in the compromised PII, etc.

At block 284, the dark web data acquisition system 140 can scan and parse content of the compromised data. For example, the collected identity-theft information may be processed for determining compromised PII by one or more parsers that recognize common formats for PII. A parser may identify token-separated data (for example, tab-delimited data). Similarly, a parser may determine a column type for columns lacking a column header, for example, by analyzing data that is present in particular columns (for example, recognizing a list of text strings as email addresses). Furthermore, a parser may identify multi-line labeled data such as, for example, “first name: John,” and various other labels that may be associated with compromised PII (for example, recognizing “ccn,” “cc” or “credit card” as possible labels for credit-card information). Additionally, by way of a further example, a parser may identify identity-theft information taken from encodings that may be present on cards such as, for example, credit cards, driver's licenses, and the like. The encodings may include, for example, track 1 and track 2 magnetic-stripe data.

Additionally, as part of the parsing process, rules may be enforced that require groups of fields to be present in particular compromised PII before allowing the particular compromised PII to be recorded in block 286. The requirement that groups of fields be present has the benefit of reducing “false positives” within compromised PII. False positives may be considered elements of compromised PII that are not deemed to be sufficiently private or sufficiently important to merit recordation. False positives may be removed from the collected identity-theft information. For example, an email address that is not accompanied by a password may be considered a false positive and not recorded. A rule may be established that requires, for example, a username or email address to be accompanied by a password in order to be recorded.

In some embodiments, a validation process may be implemented to analyze a source of the collected identity-theft information such as, for example, compromised PII, and to facilitate a determination on whether any elements of the compromised PII are false positives. For example, genealogy websites, phone/address lookup websites, and website log files are common sources of false positives. Compromised PII that is mined from such websites may be considered false positives and removed from the collected identity-theft information. Conversely, compromised PII mined, for example, from known hacker websites and websites replete with identity-theft nomenclature, in a typical embodiment, may be protected from identification as false positives.

The compromised data collected form the data websites can be normalized to ensures that the collected identity-theft information such as, for example, compromised PII, is stored (at block 286) according to a standardized format. For example, standardized data structures and attributes may be established for names, credit card numbers, and the like. The normalization process facilitates matching, for example, elements of compromised PII to particular individuals to whom the elements correspond. In that way, reports and alerts based on the compromised PII may be more efficiently and more accurately generated.

At block 286, the dark web data acquisition system 140 can insert parsed content into the dark web data store 142. In some embodiments, the parsed content can be added to the dark web data store after the validation process and the normalization process described above. As described with reference to FIG. 2A, the dark web data acquisition system 140 can also store the compromised data retrieved from the dark web site before the compromised data is parsed, validated, or normalized.

At block 288, the dark web data acquisition system 140 can generate aggregated statistics and store the aggregated statistics in the dark web data store 142. The aggregated statistics can include one or more statistics related to the identity metrics. The aggregated statistics may be specific to a user's PII, such as, for example, the number of hits found for a user's email address in a dark web search. The aggregated statistics may also be depersonalized, such as, for example, the number of hits found for a user's email domain name in a dark web search. The aggregated statistics may be stored in dedicated look up tables, such as, for example, tables storing email data 322, address data 324, card data 325, or phone data 328. In some embodiments, the aggregated statistics are stored in the identity metrics data store 158. Further, in various embodiments, the dark we data acquisition system or the metrics calculation system 154 may calculate the aggregated statistics from the compromised data.

Report and Alert Data Acquisition Process

With reference to the process 290, report and alert data associated with a group of users can also be used to determine personalized statistics. The process 290 can be implemented by the report and alert data acquisition system 230 or another system to compile reports associated with a group of users (which may or may not including the user whose personalized metrics are requested).

At block 292, the report and alert data acquisition system 230 receives a report from a client gateway or an identity monitoring system. In some embodiments, the report can also be received from the report generation system 152. For example, the report generation system 152 can forward a copy of the report to the report and alert data acquisition system 230 before, during, or after sending the report to the computing device 110. The report may include personalized metrics generated for users who subscribed to the periodic monitoring service of their identity metrics. The report may also be created in response to a one time request for personalized metrics. In some embodiments, where multiple instances of report and alert data acquisition system 230 are initiated, the reports obtained by a report and alert data acquisition system may share similar characteristics. For example, the reports may be associated with users having the same geographic characteristics (for example, the same zip code, state, or city).

At block 294, the report and alert data acquisition system 230 can insert the report into an identity alerts data store(s) 144. For example, the report and alert data acquisition system 230 can store the reports sent to a group of individuals or at least a portion of the reports' content into the identity alerts data store(s) 144. As described with reference to FIG. 2A, identity alerts data store(s) 144 may retain its data for a long term (such as, for example, for permanent storage), even though some of the individuals in the group are no longer subscribed to the periodic monitoring of their identity metrics.

At block 296, the report and alert data acquisition system 230 can determine metadata associated with the report. The metadata can include report statistics based on the types of reports and the numbers of reports sent to the users of the identity management system 130. The report statistics may be depersonalized such that individual user's PII will not be revealed in the report sent to the user. Rather, the report statistics may include numbers pertaining to the group of individuals sharing similar characteristics as the user. For example the report statistics may include the number of notifications sent to the individuals in the same geographic area as the user. Some example fields of the metadata are also illustrated in FIG. 3A.

At block 298, the report and alert data acquisition system 230 can store the metadata into a report and alert metadata database. In some embodiments, the metadata may also be retained without truncation even though the individual (or the group of individuals) associated with the metadata is no longer actively subscribed to the identity management system 130.

The blocks in the processes 280 and 290 are for illustrative purposes. The processes 280 and 290 do not have to flow exactly in the order indicated by the arrows shown in FIG. 2B. For example, the process blocks 298 and 294 may be executed in parallel. As another example, the process block 288 may be executed before the process block 286. In various implementations, more or fewer blocks may also be implemented in the processes 280 and 290.

Examples of Statistical Computation for Personalized Metrics

As described herein, the identity data analysis system 150 can use data obtained from dark web data store 142, identity alerts data store(s) 144, and the identity metrics data store 158 to generate a report for personalized metrics of a user. FIG. 3A illustrates an example embodiment of a metrics calculation system for generating personalized metrics of a user. The metrics calculation system 154 shown in FIG. 3A can be part of the identity data analysis system 150.

Dark Web Data Analysis Subsystem

In FIG. 3A, the metrics calculation system 154 includes a dark web data analysis subsystem 352 and a report and alert data analysis subsystem 354. The dark web data analysis subsystem 352 can implement the dark web data analysis process 380 and the report and alert data analysis subsystem 354 can implement the report and alert data analysis process 390 shown in FIG. 3E. Although the dark web data analysis subsystem 352 and the report and alert data analysis subsystem 354 are illustrated as two separate blocks, in some embodiments, these two subsystems may be combined a same system within the metrics calculation system 154. Further, the metrics calculation system 154 may also include other systems (such as data aggregation or synchronization system 156), which are not illustrated in this figure.

The dark web data analysis subsystem 352 can be configured to receive synchronized data from the dark web data store 142. As an example of data synchronization, the identity metrics data store 158 can maintain one or more counterpart tables which correspond to the tables in the dark web data store 142. As shown in FIG. 3A, the dark web data store 142 can be configured to store four tables related to aggregated statistics 224: the email data table 322, the address data table 324, the card data table 326, and the phone data table 328. Each data table can include a respective value, such as email domain, postal code, card BIN, and phone area code. The data tables can further include hits information such as hits date and hits count based on searches and analysis of compromised data from the dark web sites. Although in this example, four data tables are illustrated, fewer or more tables may also exist based on the identity metrics that the metrics calculation system is configured to generate. For example, where the metrics calculation system 154 is configured to calculate metrics related to social security number, the aggregated statistics 224 can also include a data table related to social security number values and hits information. The identity metrics data store 158 can maintain one or more counterpart tables corresponding to the tables in the aggregated statistics database 224. The tables in the aggregated statistics database 224 and the tables in the identity metrics database do not always remain a one-to-one correspondence. For example, the email data table 322 can correspond to two counterpart tables in the identity metrics data store 158, such as, for example, an email domain value data table and an email domain hits table. For other tables, one data table in the aggregated statistics 224 table can correspond to one data table in the identity metrics data store 340, or multiple data tables in the aggregated statistics database 224 can correspond to one data table in the identity metrics data store 340. Further, not all data tables in the dark web data store 142 have a counterpart table in the identity metrics data store 158. For example, the address data table 324, the card data table 326, or the phone data table 328 may not have a counterpart table in the identity metrics data store 158. This may be because the metrics calculation system 154 may not generate personalized metrics using data one or more of these three tables.

The dark web data analysis system can synchronize updates to the aggregated statistics database 224 (as well as updates to individual data tables in the aggregated statistics database 224) periodically or in real-time as the updates occur. Although not shown in FIGS. 3A and 3C, in some embodiments, the data synchronization with the dark web data store 142 can be performed by the data aggregation and synchronization system 156.

The dark web data analysis subsystem 352 can be configured to analyze the synchronized data and compute one or more identity statistics based on the synchronized data. The identity statistics may be stored in corresponding summary identity statistics tables. For example, identity metrics data store 158 can maintain an email domain statistics table 342 which include summary statistics of the email domains. The dark web data analysis subsystem 352 can synchronize data in the email data table 322 (from the dark web data store 142) with the counterpart tables associated with email domains stored in the identity metrics data store 158. The dark web data analysis subsystem 352 can access data in the counterpart tables to calculate summary statistics associated with the email domains. The calculated statistics maybe used to update the email domain statistics table 342. As further described with reference to FIG. 4A, the email domain statistics table 342 may be accessed by the report generation system 152 to generate personalized metrics of a user.

FIG. 3B illustrates an example embodiment of data synchronization between an identity metrics data store and a dark web data store. In this example, the dark web data store 142 can include an email data table 322, among other data. The identity metrics data store 158 can include an email domain value table 362 a and email domain hits table 362 which are counterpart tables of the email data table 322. The identity metrics data store can also include an email domain statistics table 342, among other data.

At (1), the metrics calculation system 154 (such as, for example the dark web data analysis subsystem 352) can synchronize data between email data table and its counterpart tables. For example, the metrics calculation system 154 can read data from the email data table 322 at (1A) and write data obtained from the email data table 322 into the email domain value table 362 a and email domain hits table 362 b at (1B). The metrics calculation system 154 can use a query in a Structured Query Language (SQL) and/or JOIN command to achieve the synchronization.

At (2), the metrics calculation system 154 can use data in the counterpart tables to update the email domain statistics table 342. For example, at (2A), the metrics calculation system 154 can read the data from the email domain value table 362 a and the email domain hits table 362 b. In some embodiments, (2A) may be performed using a GROUP BY and a SUM( ) function in SQL. At (2B), the metrics calculation system 154 can write the results obtained from (2A) into the email domain statistics table 342.

FIG. 3C illustrates an example embodiment of calculating identity statistics based on data from a dark web data store. As in FIG. 3B, the email data table 322 corresponds to two counterpart tables: email domain value table 362 a and email domain hits table 362 b in the identity metrics data store 158. In some embodiments, each top-level domain can be stored only once in the email domain value table 362 (such as, for example, “email.com” is stored only once under the “value” column of the email domain value table 362 a).

At (1), the metrics calculation system 154 can synchronize email data table 322 with the email domain value table 362 a and the email domain hits table 362 b. For example, the domain name can be stored in the email domain value table 362 a while the hits information (for example the hit date and hit counts) can be stored in the email domain hits table 362 b. The email domain hits table 362 can link a hit with a domain name, such as, for example, by referring to the ID of the email domain value in the email domain value table 362 a. In some embodiments, this synchronization process can be performed by the dark web data analysis subsystem 352 or by the data aggregation and synchronization system 156.

At (2), the metrics calculation system 154 can aggregate data stored in the identity metrics data store 158 and use the data in the identity metrics data store 158 to compute statistics related to an identity metric. For example, the dark web data analysis subsystem 352 can execute a process which uses a SQL JOIN command to combine the email domain value table 362 a and the email domain hits table 362, as well as apply SUMO and GROUP BY functions to obtain the results.

At (3), the metrics calculation system 154 can insert or incorporate the computation from (2) into the email domain statistics table 342. The email domain statistics table 342 can be consulted by a report generation system 152 to generate a personalized metrics report for a user as described in FIG. 4A.

Report and Alert Data Analysis Subsystem

The metrics calculation system 154 can also include a report and alert data analysis subsystem 354. The report and alert data analysis subsystem 352 can be configured to synchronize data with identity alerts data store(s) 144, analyze report data, and compute identity statistics based on data acquired from the identity alerts data store(s).

The identity metrics data store 158 can include one or more counterpart table corresponding to data in the identity alerts data store(s). For example, the identity metrics data store 158 can include a geographic record which can be a counterpart table corresponding to the report and alert metadata in the identity alert data store 144. Where multiple instances of the report and alert data acquisitions system 230 are instantiated, there may be multiple identity alerts data store(s) 144 or multiple reports and alert metadata table 332, each corresponding to an instance of the report and alert data acquisition system 230. However, one geographic record table may be maintained in the identity metrics data store 158 which can be configured to synchronize data from different instances of the report and alert data acquisition system 230. The report and alert data analysis subsystem 354 can update the one or more counterpart table periodically or in real time as updates in the identity alerts data store(s) 144 occur, and can use similar techniques as the dark web data analysis subsystem 352 to perform the synchronization. Although not shown in FIG. 3A, in some embodiments, the data synchronization with the identity alerts data store(s) 144 can be performed by the data aggregation and synchronization system 156.

The report and alert data analysis subsystem 354 can be configured to analyze the synchronized data and compute one or more notification related statistics based on the synchronized data. The report statistics may be stored in a corresponding summary report statistics table. For example, identity metrics data store 158 can maintain a geographical statistics table 344 which can include summary statistics related to report statistics for various geographic locations. The report and alert data analysis subsystem 354 can synchronize data in report and alert metadata tables 332 (from various instances of the identity alerts data store(s) 144) with the counterpart table (for example, the geographic record table) stored in the identity metrics data store 158. The report and alert data analysis subsystem 354 can access the data in the counterpart table to calculate summary report statistics associated with geographic regions. The calculated statistics may be used to update the geographical statistics table 344. In some embodiments, the metrics calculation system 154 can use a portion of the information the counterpart table to calculate summary report statistics. For example, the identity metrics data store 158 can maintain a zip code records table which serves as a counterpart table to the report and alert metadata tables 332 associated with respective instances of the report and alert data acquisition systems 230. The zip code records table can include a column on instance identifier (ID). The instance ID can keep track the report and alert acquisition system 230 from which the metadata is acquired. However, when computing the summary report statistics associated with the geographic regions, the metrics calculations system 154 can ignore the data in instance id column so that the statistics across multiple regions can be summed together. As further described with reference to FIG. 4A, the geographical statistics table 344 may be accessed by the report generation system 152 to show identity theft related statistics in the user's geographical region.

FIG. 3D illustrates an example embodiment of synchronizing the identity metrics database with a plurality of report and alert metadata databases. In this example, the zip code records table 378 in the identity metrics data store 158 can be a counterpart table of a plurality of report and alert metadata tables 332 (for example, the report and alert metadata table A 332 a, the report and alert metadata table B 332 b, and the report and alert metadata table C 332 c, and so on). The report and alert metadata tables 332 a-332 c can be stored in respective report and metadata database 224 where each report and metadata database 224 can correspond to a report and alert data acquisition system 230.

The zip code records table 372 can include fields such as, for example, date, zip code, record type ID, and instance ID. The record type ID can indicate the type of record. The record type ID can be linked to the ID in the record type table 376 a. Some example types of records can include dark web security alerts, sex offender notifications, court records notifications, SSN trace notification, non-credit loan notification, change of address notification, and early warning alerts. These record types can be used to generate personalized metrics which include statistics in one or more types of the records (as shown in FIG. 4B). The instance ID field in the zip code records table 378 can be derived from a synchronization configuration file 376 b. The synchronization configuration file can set forth the instance information for the identity alerts data store(s) 144 or the report and alert data acquisition system(s) 230. Some example configuration can include which host and port number an instance of the identity alerts data store(s) 144 or the report and alert data acquisition system(s) 230 is corresponding to.

In some embodiments, the record type ID values and the instance ID field in the zipcode records table can be obtained at the initialization phase, such as, for example, during the initialization of the zip code records table 378, the identity metrics data store 158, the metrics calculation system 154, the identity data analysis system 150, the identity management system 130, the identity alerts data store(s) 144, or other components of the identity management system 130, alone or in combination.

At (1), the metrics calculation system 154 can synchronize with each instance of the report and alert metadata tables 332. The synchronization process can be performed, for example, using SQL (such as the SUMO and GROUP By function).

At (2), the result obtained from each instance can be written to the zip code records table 378 for storage. The zip code table 378 can be accessed by the metrics calculation system 154 for generating report statistics which can be stored into the geographical statistics table 344. The geographical statistics table 344 can be accessed by the report generation system 152 to generate a personalized metrics report including report statistics in a user's geographic region.

In some embodiments, the synchronization in (1) and (2) can be performed by the data aggregation and synchronization system 156. Although various examples in FIGS. 3B-3D refer to SQL commands or functions, the various methods and steps in these figures do not have to be performed by SQL. Other languages, such as Java, Ruby, Hypertext Pre-Processor (PHP), C++, or any language supported by the dark web data store 142, the identity alerts data store(s) 144, or the identity metrics data store 158, can also be used to implement the techniques described herein.

Statistical Computation Processes

FIG. 3E illustrates example embodiments of statistical computation processes, which include a dark web data analysis process 380 and a report and alert data analysis process 390. The process 380 can be implemented by the dark web data analysis subsystem 352 and the process 290 can be implemented by the report and alert data analysis subsystem 354, but could also be implemented by other systems.

Dark Web Data Analysis Process

With reference to the dark web data analysis process 380, at block 382, the dark web data analysis subsystem 352 can periodically synchronize with the dark web data store 142. For example, the dark web analysis subsystem can synchronize various aggregated statistics tables (for example, tables 322, 324, 326, 328) with the counterpart tables in the identity metrics data store 158. For example, the dark web data analysis subsystem 352 can periodically synchronize the email data table 322 stored in the dark web data store 142 with its counterparts tables (for example, email domain value table and email domain hits table) stored in the identity metrics data store 158.

At block 384, the dark web data analysis subsystem 352 updates the counterpart tables stored in the identity metrics data store based on the updates. For example, the updates may include updates to aggregated statistics due to analysis (for example, by the dark web data acquisition system 140) of recently acquired compromised data. In some embodiments, the updates may be appended to one or more counterpart tables. With reference to the example in the preceding paragraph, the dark web data analysis subsystem 352 can append new email domain names (acquired from updates to the email data table 322) to the email domain value table.

At block 386, the dark web data analysis subsystem 352 can compute identity statistics. The identity statistics can be calculated based on data in the counterpart tables or updates received from the dark web data store 142. Continuing with the above-mentioned example, one of the identity statistics may be email domain statistics, which can include the hit date of a domain name (for example, found in compromised data), the total number of hits historically or within a certain time period (for example, within the past 60 days), the number of distinct days where hits are found, and so on.

At block 388, the dark web data analysis subsystem 352 can update a summary identity statistics table (such as the email domain statistics table 342) with the computed identity statistics. For example, the number of hits in the past 60 days of an email domain name in the email domain statistics table may be updated to reflect the most recent calculation occurred at the block 386.

Report and Alert Data Analysis Process

With reference to the report and alert data analysis process 390, at block 392, the report and alert data analysis subsystem 354 can periodically synchronize with one or more identity alerts data store(s). For example, there may be multiple instances of the report and alert data acquisition system 230 and thus there may be multiple identity alerts data stores 144 or report and alert metadata databases 244. For example, each report and alert metadata database 244 may maintain a report and alert metadata table 332 corresponding to statistics associated with a particular report and an alert data acquisition system 230.

The report and alert data analysis subsystem 354 can acquire updates to the one or more alerts data store and update counterpart table stored in the identity metrics data store at block 394. For example, even though there may be multiple identity alerts data stores or report and alert metadata tables, there may be one counterpart table corresponding to these identity alerts data stores or report and alert metadata tables. Thus, the report and alert data analysis subsystem 354 can combine data from the multiple identity alerts data stores or report and alert metadata tables and update the counterpart table accordingly. In some implementations, the report and alert data analysis system 354 can append updates to the counterpart table.

At block 396, the report and alert data analysis subsystem 354 can compute report statistics. The report statistics can be calculated based on the data in the counterpart table or updates received from the one or more identity alerts data store. As an example, the report statistics may be based on geographic regions, which can include the number of hits found for a geographic location, earliest hit date or latest hit date, the total number of hits historically or within a certain time period (for example, within the past 60 days), and so on.

At block 398, the report and alert data analysis subsystem 354 can update a summary report statistics table (such as the geographical statistics table 344) with the computed report statistics. For example, the number of hits or reports delivered historically for a zip code stored in the geographical statistics table 344 may be updated to reflect the most recent calculation occurred at the block 396.

Examples of Report Generation

As described with reference to FIG. 1, the identity management system 130 can receive a request from a client computing device 110 and return a report comprising personalized identity metrics to the client computing device 110. FIG. 4A illustrates an example embodiment of a data flow diagram for report generation. The example data flow diagram 400 can involve a client computing device 110, a client gateway 162, a report generation system 152, a dark web data store 142, an identity alerts data store 144, and an identity metrics data store 158.

At (1), the client computing device 110 can send a request to the client gateway 162 for a report of a user's personalized metrics. The request can include the user's PII such as, for example, one or more of the user's email addresses (for example, “username1@email.com^(”) and “username2@anotheremail.com”), zip codes associated with the user's addresses (for example, the user's address in the past 2 years), the user's phone numbers (for example, the user's work phone and home phone numbers), and the financial card's BIN numbers (for example, the first five or six digits of a user's credit card numbers). The request may be in an XML format, although other formats such as, for example, txt or HTML are also possible. In some implementations, the request may also include authentication mechanisms such as password and username to increase the security of the user's personalized metrics.

The client gateway 162 can parse the request to extract the user's PII. At (2), the client gateway 162 can communicate the request to a report generation system 152 for generating a report based on the user's PII. Although in this example, the client gateway 162 communicates the request for report to the report generation system 152, in various implementations, the client gateway 162 can also communicate the request to other components of the identity data analysis system 150.

At (3), the report generation system 152 can communicate with the dark web data store 142 to obtain information specific to the user. For example, the report generation system 152 can use the received PII to obtain a report from the dark web data store 142. As the example shown in the code block 430, the report can include whether the user's email address was found on websites which had security breaches and the number of hits (for example, found in dark web sites 120) over a certain time period (for example, last 60 days or since when the user is subscribed to the identity management system 130).

At (4), the report generation system 152 can communicate with the identity metrics data store 158 to obtain summary statistics based on the user's PII. For example, as shown in the code block 440, the summary statistics can include total number of hits for all of the user's email addresses found in dark web data. Some of the summary statistics can be depersonalized, which can be calculated based on data for users having similar PIIs. For example, the report generation system 152 can include summary statistics related to the number of security breaches associated with an email domain over a time period. As shown in the example code block 440, the historical total number of hits for the email domain name “email.com” is around 500 million and the total number of hits in the past 60 days for “email.com” is about 15 million. These total numbers of hits can include hits associated with other user's email addresses having the same email domain name. The report generation system 152 can query the email domain statistics table 342 to obtain the summary statistics related to the email domain.

At (5), the report generation system 152 can communicate with the identity alerts data store(s) 144 to retrieve report statistics. As an example, the code block 450 shows that total number of reports sent to all users as well as the number of reports sent to users within a given zip code or a given state in the past 60 days and historically. As further described with reference to FIG. 4B, the report generation system 152 can also obtain statistics related to other types of notifications from the identity alerts data store(s) 144. Some example types of notifications include: criminal records, non-credit loans, change of address, financial account takeover, dark web related breaches, and so forth. These statistics can also be broken down by geographic regions. Although in this example, the report generation system 152 obtains report statistics from the identity alerts data store(s) 144, in some implementations, the report generation system 152 can also communicate with the identity metrics data store 158 to obtain at least a portion of the report statistics. For example, the report generation system 152 can obtain report statistics specific to user's geographic region by consulting the geographic statistics table 344 in the identity metrics data store 158 while obtain national report statistics from the identity alerts data store(s).

The report generation system 152 can generate the report compiling data obtained from (3) through (5). The report may be in an XML format (or any other suitable format). At (6), the report generation system 152 can provide the report to the client gateway 162 which, at (7), will pass the report to the client computing device 110 for display to a user. The report can be delivered in an email, a text message, or other types of messaging system. The report can also be rendered on a web portal.

Examples of Reports

FIG. 4B illustrates example embodiments of reports delivered to a client computing device. FIG. 4B illustrates two user interface screens 460 and 480. The user interface screen 460 may be part of an email message while the user interface screen 480 may be a user interface element (for example, a widget or a banner) displayed on a web portal.

The user interface screen 460 shows a monthly report. This monthly report may be generated as a result of a user's subscription to an identity monitoring service offered by the identity management system 130. For example, the report generation system 152 can automatically gather user specific data, summary identity statistics, and summary report statistics every month and deliver a monthly report to the user.

The monthly report in the user interface screen 460 shows a monthly summary of notifications (shown by the user interface element 462) which may include the number of notifications delivered by the identity management system 130 to the user. The monthly report can also include report statistics associated with the user's geographic region. As shown by the user interface element 464, report statistics can include the number of notifications related to the change of addresses, court records (for example, filings of court cases), dark web security alerts (for example, alerts regarding unauthorized use of the user's PII or potential breach of the user's PIM financial account takeovers, non-credit loans, sex offenders (for example, sex offenders identified as associated with the user's geographic region), and/or social security number traces.

Once receiving the report, the user can actuate the user interface element 466 to review further information of the report on a web portal. The user interface element 480 shows a user interface element on a web portal. This user interface element 480 shows summary statistics (as indicated by the arrow 482) related to the total number of notifications sent to all U.S. subscribers of the identity management system 130 in the past 60 days. The user interface element 480 also shows the summary report statistics (as shown by the arrow 484) in the user's city.

Example Processes for Generating and Monitoring Identity Metrics

The identity data management system 130 may implement processes for identity metrics analysis as well as an identity monitoring process.

Identity Metrics Analysis Process

FIG. 5A illustrates an example embodiment of an identity metrics analysis process. The process 500 can be performed by the identity data analysis system 150 described herein

At block 520, the identity data analysis system 150 can receive a request for a user's identity metrics. The identity metrics may include personalized metrics associated with a user's PII. The personalized metrics information specific to user's PII or depersonalized statistics based on information from a group of users. The request may be received from a client gateway 162, an identity monitoring system 164, or directly from a client computing device 110. The request may include a user's PII which the identity data analysis system can use to generate personalized metrics.

At block 530, the identity data analysis system 150 can access data from at least one of a dark web data store, an identity alerts data store 144, or an identity metrics data store 158. In some implementations, the identity metrics data store 158 can periodically synchronize with the dark web data store 142 and the identity alerts data store(s) 144 to include data in the dark web data store 142 and the identity alerts data store 144. Thus, the identity data analysis system 150 can communicate with the identity metrics data store 158 to acquire compromised PII, reports information, as well as aggregated statistics. In other implementations, the identity data analysis system 150 can communicate with the dark web data store 142 to acquire compromised PII, communicate with one or more identity alerts data store 144 to acquire information of various reports, and communicate with the identity metrics data store to retrieve aggregated statistics. In some situations, a request for user's identity metrics can cause the identity data analysis system 150 to synchronize the data in the identity metrics data store 158 with the dark web data store 142 and one or more identity alerts data store 144. For example, the request may trigger the data aggregation and synchronization system 156 to access an identity alerts data store report information of those individuals having similar demographic characteristics as the user.

At block 540, the identity data analysis system 150 can determine identity metrics related to the PII based on the accessed data. For example, the identity data analysis system 150 can determine the email domain associated with the user's email address. The identity data analysis system can find the number of compromises associated with the same email domain. As another example, the identity data analysis system can find the number of reports sent to users in the same geographical area. In some embodiments, the operations in the blocks 530 and 540 may be combined. For example, the identity data analysis system (or the metric calculation system 154 of the identity data analysis system 150) can execute a query including a portion of the PII and to retrieve the identity metrics associated with the user. For example, the identity data analysis system can query the identity metrics data store 340 for the statistics associated with the user's geographical region.

At block 550, the identity data analysis system 150 can compile the identity metrics into a report. The report may include various sections, such as a user specific section, a statistics section, a geographically related alerts section, and so on. Compromised data from the dark web may be used to populate the user specific section. For example, if the user's email address is flagged as compromised based on the analysis of the dark web data, the identity data analysis system can input the user's email address as well as the site where the compromise was discovered into the user specific portion of the report. As another example, the aggregated statistics may be accessed form the identity metrics data store 158 and populate the statistics section of the report. For example, statistics regarding the number of compromised email address having the same domain name as the user's email address may be added to the statistics portion of the report. The geographically related alerts section may be populated based on data from the identity alerts data store. Some example, data may include the number of notifications provided by the identity management system 130 to people in the same city as the user. As described with reference to FIG. 4A, the report generation system 152 of the identity data analysis system 150 may be configured to perform the process at block 550.

At block 560, the report can be communicated to a requesting entity for display to the user. As described with reference to FIG. 1, the requesting entity may be an identity monitoring service, a company or a person interested in learning a user's identity metrics (for example, an employer), or the user himself. The report may be in an XML format and may be communicated via an API. The identity data analysis stem 150 can communicate the report to the requesting entity via a client gateway 162, an identity monitoring system 164 or directly to the client computing device 110.

Identity Monitoring Process

FIG. 5B illustrates an example embodiment of an identity monitoring process. The process 510 may be performed by the identity monitoring system 164 alone or in combination with user identity registration system 166 or the identity data analysis system 150, as well as another system.

At block 502, the identity monitoring system 164 can receive a subscription request associated with a user. The subscription request may include a request to enroll a user into a periodic monitoring service of the user's identity metrics.

At block 504, the identity monitoring system 164 can determine the user's PII and store the PII in a data store. For example, the subscription request may include a user's PII and the identity monitoring system can parse the request to identify the user's PII and store the identified PII into a data store (such as, for example, the identity alerts data store 144). In some embodiments, the identity monitoring system can acquire additional PII based on the PII in the user's request. For example, the PII in the user's request may include the user's name and the social security number, the identity monitoring system can use this PII in the user's request to obtain the user's financial account information, the user's phone number, or the address information by communicating with another database (such as, for example, a database associated with a credit bureau). The additional PII can also be stored in the data store together with the PII received in the subscription request.

At block 506, the identity monitoring system 164 can periodically generate a request to invoke the process 500 in FIG. 5A. For example, the identity monitoring system can periodically request identity metrics using the user's PII (which can include the PII received form the subscription request alone or in combination with the additional PII acquired). In some embodiments, once the user is subscribed to the periodic monitoring service, identity data analysis system can periodically generate a report in the form of an alert for the user. For example, for every two months, the identity data analysis system can send a report including personalized metrics in the past 60 days. This alert may be automatically generated without the performing the process block 506.

Example Computing System Implementation and Architecture

FIG. 6 illustrates a general architecture of a computing system for processing attributes and implementing various other aspects of the present disclosure. Many or all of the components of the computing system shown in FIG. 6 may be included in the various computing devices and systems discussed herein. The computing system may include, for example, a personal computer (such as, for example, IBM, Macintosh, Microsoft Windows compatible, OS X compatible, Linux/Unix compatible, or other types of computing systems, alone or in combination), a server, a workstation, a laptop computer, a smart phone, a smart watch, a personal digital assistant, a kiosk, a car console, a tablet, or a media player. In one embodiment, the computing system's processing system 600 includes one or more central processing units (CPU) 612, which may each include a conventional or proprietary microprocessor specially configured to perform, in whole or in part, one or more of the features described above. The processing system 600 further includes one or more memory 618, such as random access memory (RAM) for temporary storage of information, one or more read only memory (ROM) for permanent storage of information, and one or more mass storage device 603, such as a hard drive, diskette, solid state drive, or optical media storage device. A data store 621 may also be included. In some implementations, the data store 621 may be designed to handle large quantities of data and provide fast retrieval of the records. To facilitate efficient storage and retrieval, the data store 621 may be indexed using one or more of compressed data, identifiers, or other data, such as that described above.

Typically, the components of the processing system 600 are connected using a standards-based bus system 624. In different embodiments, the standards-based bus system 624 could be implemented in Peripheral Component Interconnect (PCI), Microchannel, Small Computer System Interface (SCSI), Industrial Standard Architecture (ISA) and Extended ISA (EISA) architectures, for example. In addition, the functionality provided for in the components and modules of processing system 600 may be combined into fewer components and modules or further separated into additional components and modules.

The processing system 600 is generally controlled and coordinated by operating system software, such as Windows XP, Windows Vista, Windows 7, Windows 8, Windows 10, Windows Server, Unix, Linux, SunOS, Solaris, iOS, MAC OS X, Blackberry OS, Android, or other operating systems. In other embodiments, the processing system 600 may be controlled by a proprietary operating system. The operating system is configured to control and schedule computer processes for execution, perform memory management, provide file system, networking, I/O services, and provide a user interface, such as a graphical user interface (GUI), among other things. The GUI may include an application interface and/or a web-based interface including data fields for receiving input signals or providing electronic information and/or for providing information to the user in response to any received input signals. A GUI may be implemented in whole or in part using technologies such as HTML, Flash, Java, .net, web services, and RSS. In some implementations, a GUI may be included in a stand-alone client (for example, thick client, fat client) configured to communicate (for example, send or receive data) in accordance with one or more of the aspects described.

The processing system 600 may include one or more commonly available input/output (I/O) devices and interfaces 615, such as a keyboard, stylus, touch screen, mouse, touchpad, and printer. In one embodiment, the I/O devices and interfaces 615 include one or more display devices, such as a monitor, that allows the visual presentation of data to a user. More particularly, a display device provides for the presentation of GUIs, application software data, and multimedia presentations, for example. The processing system 600 may also include one or more multimedia devices 606, such as speakers, video cards, graphics accelerators, and microphones, for example.

In the embodiment of FIG. 6, the I/O devices and interfaces 615 provide a communication interface to various external devices. The processing system 600 may be electronically coupled to one or more networks, which comprise one or more of a LAN, WAN, cellular network, satellite network, and/or the Internet, for example, via a wired, wireless, or combination of wired and wireless communication link. The networks communicate with various computing devices and/or other electronic devices via wired or wireless communication links.

In some embodiments, information may be provided to the processing system 600 over a network from one or more data sources. The data sources may include one or more internal and/or external data sources. In some embodiments, one or more of the databases or data sources may be implemented using a relational database, such as Sybase, Oracle, CodeBase and Microsoft® SQL Server as well as other types of databases such as, for example, a flat file database, a non-relational database, an entity-relationship database, and object-oriented database, and/or a record-based database.

In general, the word “module,” as used herein, refers to logic embodied in hardware or firmware, or to a collection of software instructions, possibly having entry and exit points, written in a programming language, such as, for example, Java, Lua, C, or C++. A software module may be compiled and linked into an executable program, installed in a dynamic link library, or may be written in an interpreted programming language such as, for example, BASIC, Perl, or Python. It will be appreciated that software modules may be callable from other modules or from themselves, and/or may be invoked in response to detected events or interrupts. Software modules configured for execution on computing devices may be provided on a computer readable medium, such as a compact disc, digital video disc, flash drive, or any other tangible medium. Such software code may be stored, partially or fully, on a memory device of the executing computing device, such as the processing system 600, for execution by the computing device. Software instructions may be embedded in firmware, such as an EPROM. It will be further appreciated that hardware modules may be comprised of connected logic units, such as gates and flip-flops, and/or may be comprised of programmable units, such as programmable gate arrays or processors. The modules described herein are preferably implemented as software modules. They may be represented in hardware or firmware. Generally, the modules described herein refer to logical modules that may be combined with other modules or divided into sub-modules despite their physical organization or storage.

In the example of FIG. 6, the modules 609 may be configured for execution by the CPU 612 to perform, in whole or in part, any or all of the process discussed above, such as those shown in FIGS. 2B, 3E, 5A, and 5B. The processes may also be performed by one or more virtual machines. For example, the processes may be hosted by a cloud computing system. In some embodiments, one or more components of the processing system 600 may be part of the cloud computing system. Additionally or alternatively, the virtualization may be achieved at the operating system level. For example, the one or more processes described herein may be executed using application containerization. The one or more processes may also be implemented on a Lambda architecture designed to handle mass quantities of data by taking advantage of the batch processing and the stream processing.

It is to be understood that not necessarily all objects or advantages may be achieved in accordance with any particular embodiment described herein. Thus, for example, those skilled in the art will recognize that certain embodiments may be configured to operate in a manner that achieves or optimizes one advantage or group of advantages as taught herein without necessarily achieving other objects or advantages as may be taught or suggested herein.

All of the processes described herein may be embodied in, and fully automated, via software code modules executed by a computing system that includes one or more computers or processors. In some embodiments, at least some of the processes may be implemented using virtualization techniques such as, for example, cloud computing, application containerization, or Lambda architecture, etc., alone or in combination. The code modules may be stored in any type of non-transitory computer-readable medium or other computer storage device. Some or all the methods may be embodied in specialized computer hardware.

Many other variations than those described herein will be apparent from this disclosure. For example, depending on the embodiment, certain acts, events, or functions of any of the algorithms described herein can be performed in a different sequence or can be added, merged, or left out altogether (for example, not all described acts or events are necessary for the practice of the algorithms). Moreover, in certain embodiments, acts or events can be performed concurrently, for example, through multi-threaded processing, interrupt processing, or multiple processors or processor cores or on other parallel architectures, rather than sequentially. In addition, different tasks or processes can be performed by different machines and/or computing systems that can function together.

The various illustrative logical blocks and modules described in connection with the embodiments disclosed herein can be implemented or performed by a machine, such as a virtual machine, a processing unit or processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A processor can be a microprocessor, but in the alternative, the processor can be a controller, microcontroller, or state machine, combinations of the same, or the like. A processor can include electrical circuitry configured to process computer-executable instructions. In another embodiment, a processor includes an FPGA or other programmable device that performs logic operations without processing computer-executable instructions. A processor can also be implemented as a combination of computing devices, for example, a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration. Although described herein primarily with respect to digital technology, a processor may also include primarily analog components. For example, some or all of the signal processing algorithms described herein may be implemented in analog circuitry or mixed analog and digital circuitry. A computing environment can include any type of computer system, including, but not limited to, a computer system based on a microprocessor, a mainframe computer, a digital signal processor, a portable computing device, a device controller, or a computational engine within an appliance, to name a few.

Conditional language such as, among others, “can,” “could,” “might” or “may,” unless specifically stated otherwise, are understood within the context as used in general to convey that certain embodiments include, while other embodiments do not include, certain features, elements and/or steps. Thus, such conditional language is not generally intended to imply that features, elements and/or steps are in any way required for one or more embodiments or that one or more embodiments necessarily include logic for deciding, with or without user input or prompting, whether these features, elements and/or steps are included or are to be performed in any particular embodiment.

Disjunctive language such as the phrase “at least one of X, Y, or Z,” unless specifically stated otherwise, is understood with the context as used in general to present that an item, term, etc., may be either X, Y, or Z, or any combination thereof (for example, X, Y, and/or Z). Thus, such disjunctive language is not generally intended to, and should not, imply that certain embodiments require at least one of X, at least one of Y, or at least one of Z to each be present.

Any process descriptions, elements or blocks in the flow diagrams described herein and/or depicted in the attached figures should be understood as potentially representing modules, segments, or portions of code which include one or more executable instructions for implementing specific logical functions or elements in the process. Alternate implementations are included within the scope of the embodiments described herein in which elements or functions may be deleted, executed out of order from that shown, or discussed, including substantially concurrently or in reverse order, depending on the functionality involved as would be understood by those skilled in the art.

Unless otherwise explicitly stated, articles such as “a” or “an” should generally be interpreted to include one or more described items. Accordingly, phrases such as “a device configured to” are intended to include one or more recited devices. Such one or more recited devices can also be collectively configured to carry out the stated recitations. For example, “a processor configured to carry out recitations A, B and C” can include a first processor configured to carry out recitation A working in conjunction with a second processor configured to carry out recitations B and C.

It should be emphasized that many variations and modifications may be made to the above-described embodiments, the elements of which are to be understood as being among other acceptable examples. All such modifications and variations are intended to be included herein within the scope of this disclosure. 

What is claimed is:
 1. A identity security system, the identity security system comprising: a non-transitory data storage configured to store computer executable instructions for an identity data analysis system; a dark web data store configured to store compromised data related to users' personal identifying information (PII) and aggregated statistics calculated based on the compromised data; a plurality of identity alerts data stores configured to store alerts sent to the users and metadata associated with the alerts; an identity metrics data store configured to store depersonalized summary statistics; and a hardware processor programmed to execute the computer executable instructions in the non-transitory data storage to cause the identity security system to: receive an instruction to generate an alert comprising personalized metrics of a user, wherein the instruction comprises PII of the user; query a dark web data store with the PII of the user to acquire information on security compromises related to the PII of the user; query an identity alerts data store with geographic location information of the user to obtain report statistics associated with a geographic region of the user; query an identity metrics database to obtain the depersonalized summary statistics, wherein the depersonalized summary statistics are generated based on a group of individuals whose PIIs overlaps at least with a portion of the user's PII; generate an alert which comprises information on security compromises related to the PII of the user, report statistics associated with a geographic region of the user, and the depersonalized summary statistics; and deliver the alert to a client computing device via a network, wherein compromised PIIs are acquired from dark web sites using one or more automated data collection processes.
 2. The identity security system of claim 1, wherein the identity metrics data store is further configured to store counterpart tables corresponding to the aggregated statistics in the a dark web data store and the metadata in the plurality of identity alerts data stores, and the hardware processor is further programmed to periodically synchronize the counterpart tables with data in the dark web data store and the identity alerts data store.
 3. The identity security system of claim 2, wherein the counterpart tables comprise a first counterpart table and a second counterpart table each of which corresponds to an aspect of a data table in the dark web data store.
 4. The identity security system of claim 2, wherein the counterpart tables comprise a third counterpart table which corresponds to metadata of the plurality of identity alerts data store.
 5. The identity security system of claim 2, wherein the hardware processor is programmed to update the summary statistics in the identity metrics data store based at least partly on updates to the counterpart tables in the periodic synchronization.
 6. The identity security system of claim 1, wherein the alert comprises a report in an extensible markup language format.
 7. A method for protecting identity security, the method comprising: receiving an instruction to generate an alert comprising personalized metrics of a user, wherein the instruction comprises PII of the user; querying a dark web data store with the PII of the user to acquire information on security compromises related to the PII of the user; querying an identity alerts data store with geographic location information of the user to obtain report statistics associated with a geographic region of the user; querying an identity metrics database to obtain depersonalized summary statistics, wherein the depersonalized summary statistics are generated based on a group of individuals whose PIIs overlaps at least with a portion of the user's PII; generating an alert which comprises information on security compromises related to the PII of the user, report statistics associated with a geographic region of the user, and the depersonalized summary statistics; and delivering the alert to a client computing device via a network, wherein compromised PIIs are acquired from dark web sites using one or more automated data collection processes.
 8. The method of claim 7, wherein the method further comprises periodically synchronizing counterpart tables in the identity metrics data store with data in the dark web data store and the identity alerts data store.
 9. The method of claim 8, wherein the counterpart tables comprise a first counterpart table and a second counterpart table each of which corresponds to an aspect of a data table in the dark web data store.
 10. The method of claim 8, wherein the counterpart tables comprise a third counterpart table which corresponds to metadata of a plurality of identity alerts data store.
 11. The method of claim 8, wherein the method further comprises updating the summary statistics in the identity metrics data store based at least partly on updates to the counterpart tables in the periodic synchronization.
 12. The method of claim 7, wherein the alert comprises a report in an extensible markup language format.
 13. Non-transitory computer readable medium storing computer executable instructions thereon, the computer executable instructions when executed cause an identity security system to: receive an instruction to generate an alert comprising personalized metrics of a user, wherein the instruction comprises PII of the user; query a dark web data store with the PII of the user to acquire information on security compromises related to the PII of the user; query an identity alerts data store with geographic location information of the user to obtain report statistics associated with a geographic region of the user; query an identity metrics database to obtain depersonalized summary statistics, wherein the depersonalized summary statistics are generated based on a group of individuals whose PIIs overlaps at least with a portion of the user's PII; generate an alert which comprises information on security compromises related to the PII of the user, report statistics associated with a geographic region of the user, and the depersonalized summary statistics; and deliver the alert to a client computing device via a network, wherein compromised PIIs are acquired from dark web sites using one or more automated data collection processes.
 14. The non-transitory computer readable medium of claim 13, wherein the computer executable instructions further causes the identity security system to periodically synchronize counterpart tables in the identity metrics data store with data in the dark web data store and the identity alerts data store.
 15. The non-transitory computer readable medium of claim 14, wherein the counterpart tables comprise a first counterpart table and a second counterpart table each of which corresponds to an aspect of a data table in the dark web data store.
 16. The non-transitory computer readable medium of claim 14, wherein the counterpart tables comprise a third counterpart table which corresponds to metadata of a plurality of identity alerts data store.
 17. The non-transitory computer readable medium of claim 14, wherein the computer executable instructions further causes the identity security system to update the summary statistics in the identity metrics data store based at least partly on updates to the counterpart tables in the periodic synchronization.
 18. The non-transitory computer readable medium of claim 13, wherein the alert comprises a report in an extensible markup language. 